Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 14446 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 63 |
| Duplicate rows (%) | 0.4% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 120.0 B |
Variable types
| DateTime | 2 |
|---|---|
| Text | 4 |
| Categorical | 3 |
| Numeric | 6 |
| Dataset has 63 (0.4%) duplicate rows | Duplicates |
lat is highly overall correlated with merch_lat and 1 other fields | High correlation |
long is highly overall correlated with merch_long and 1 other fields | High correlation |
merch_lat is highly overall correlated with lat and 1 other fields | High correlation |
merch_long is highly overall correlated with long and 1 other fields | High correlation |
state is highly overall correlated with lat and 3 other fields | High correlation |
is_fraud is highly imbalanced (72.3%) | Imbalance |
Reproduction
| Analysis started | 2024-10-19 12:25:10.178243 |
|---|---|
| Analysis finished | 2024-10-19 12:25:17.845765 |
| Duration | 7.67 seconds |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
| Distinct | 12126 |
|---|---|
| Distinct (%) | 83.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
| Minimum | 2019-01-01 00:00:00 |
|---|---|
| Maximum | 2020-12-31 23:59:00 |
merchant
Text
| Distinct | 693 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
Length
| Max length | 39 |
|---|---|
| Median length | 32 |
| Mean length | 17.514952 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | "Stokes, Christiansen and Sipes" |
|---|---|
| 2nd row | Predovic Inc |
| 3rd row | Wisozk and Sons |
| 4th row | Murray-Smitham |
| 5th row | Friesen Lt |
| Value | Count | Frequency (%) |
| and | 5319 | 15.8% |
| llc | 1079 | 3.2% |
| inc | 992 | 3.0% |
| sons | 881 | 2.6% |
| lt | 762 | 2.3% |
| plc | 724 | 2.2% |
| group | 505 | 1.5% |
| greenholt | 149 | 0.4% |
| baumbach | 141 | 0.4% |
| bahringer | 131 | 0.4% |
| Other values (677) | 22901 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 20557 | 8.1% |
| n | 19719 | 7.8% |
| 19138 | 7.6% | |
| a | 17620 | 7.0% |
| r | 13741 | 5.4% |
| o | 12513 | 4.9% |
| i | 12237 | 4.8% |
| t | 9843 | 3.9% |
| s | 9131 | 3.6% |
| " | 8876 | 3.5% |
| Other values (45) | 109646 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 253021 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 20557 | 8.1% |
| n | 19719 | 7.8% |
| 19138 | 7.6% | |
| a | 17620 | 7.0% |
| r | 13741 | 5.4% |
| o | 12513 | 4.9% |
| i | 12237 | 4.8% |
| t | 9843 | 3.9% |
| s | 9131 | 3.6% |
| " | 8876 | 3.5% |
| Other values (45) | 109646 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 253021 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 20557 | 8.1% |
| n | 19719 | 7.8% |
| 19138 | 7.6% | |
| a | 17620 | 7.0% |
| r | 13741 | 5.4% |
| o | 12513 | 4.9% |
| i | 12237 | 4.8% |
| t | 9843 | 3.9% |
| s | 9131 | 3.6% |
| " | 8876 | 3.5% |
| Other values (45) | 109646 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 253021 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 20557 | 8.1% |
| n | 19719 | 7.8% |
| 19138 | 7.6% | |
| a | 17620 | 7.0% |
| r | 13741 | 5.4% |
| o | 12513 | 4.9% |
| i | 12237 | 4.8% |
| t | 9843 | 3.9% |
| s | 9131 | 3.6% |
| " | 8876 | 3.5% |
| Other values (45) | 109646 |
category
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
| grocery_pos | |
|---|---|
| gas_transport | |
| shopping_net | |
| shopping_pos | |
| home | |
| Other values (9) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 10.578707 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | grocery_net |
|---|---|
| 2nd row | shopping_net |
| 3rd row | misc_pos |
| 4th row | grocery_pos |
| 5th row | health_fitness |
Common Values
| Value | Count | Frequency (%) |
| grocery_pos | 1602 | |
| gas_transport | 1430 | |
| shopping_net | 1408 | |
| shopping_pos | 1354 | |
| home | 1304 | |
| kids_pets | 1141 | |
| personal_care | 990 | 6.9% |
| entertainment | 953 | 6.6% |
| health_fitness | 891 | 6.2% |
| food_dining | 870 | 6.0% |
| Other values (4) | 2503 |
Length
| Value | Count | Frequency (%) |
| grocery_pos | 1602 | |
| gas_transport | 1430 | |
| shopping_net | 1408 | |
| shopping_pos | 1354 | |
| home | 1304 | |
| kids_pets | 1141 | |
| personal_care | 990 | 6.9% |
| entertainment | 953 | 6.6% |
| health_fitness | 891 | 6.2% |
| food_dining | 870 | 6.0% |
| Other values (4) | 2503 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 16099 | |
| e | 14230 | |
| o | 14081 | |
| n | 13375 | |
| p | 12864 | |
| _ | 11804 | 7.7% |
| t | 11730 | 7.7% |
| r | 10330 | 6.8% |
| i | 9131 | 6.0% |
| g | 7138 | 4.7% |
| Other values (10) | 32038 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 152820 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 16099 | |
| e | 14230 | |
| o | 14081 | |
| n | 13375 | |
| p | 12864 | |
| _ | 11804 | 7.7% |
| t | 11730 | 7.7% |
| r | 10330 | 6.8% |
| i | 9131 | 6.0% |
| g | 7138 | 4.7% |
| Other values (10) | 32038 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 152820 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 16099 | |
| e | 14230 | |
| o | 14081 | |
| n | 13375 | |
| p | 12864 | |
| _ | 11804 | 7.7% |
| t | 11730 | 7.7% |
| r | 10330 | 6.8% |
| i | 9131 | 6.0% |
| g | 7138 | 4.7% |
| Other values (10) | 32038 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 152820 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 16099 | |
| e | 14230 | |
| o | 14081 | |
| n | 13375 | |
| p | 12864 | |
| _ | 11804 | 7.7% |
| t | 11730 | 7.7% |
| r | 10330 | 6.8% |
| i | 9131 | 6.0% |
| g | 7138 | 4.7% |
| Other values (10) | 32038 |
amt
Real number (ℝ)
| Distinct | 9266 |
|---|---|
| Distinct (%) | 64.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 124.43007 |
| Minimum | 1 |
|---|---|
| Maximum | 3261.47 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 113.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.72 |
| Q1 | 12.08 |
| median | 51.52 |
| Q3 | 101.03 |
| 95-th percentile | 784.2175 |
| Maximum | 3261.47 |
| Range | 3260.47 |
| Interquartile range (IQR) | 88.95 |
Descriptive statistics
| Standard deviation | 231.35259 |
|---|---|
| Coefficient of variation (CV) | 1.859298 |
| Kurtosis | 16.110855 |
| Mean | 124.43007 |
| Median Absolute Deviation (MAD) | 41.82 |
| Skewness | 3.4910004 |
| Sum | 1797516.8 |
| Variance | 53524.02 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.77 | 11 | 0.1% |
| 1.64 | 11 | 0.1% |
| 5.24 | 11 | 0.1% |
| 4.39 | 10 | 0.1% |
| 9.79 | 10 | 0.1% |
| 2.72 | 10 | 0.1% |
| 8.34 | 10 | 0.1% |
| 6.06 | 9 | 0.1% |
| 6.79 | 9 | 0.1% |
| 1.21 | 9 | 0.1% |
| Other values (9256) | 14346 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 1.01 | 6 | |
| 1.02 | 2 | < 0.1% |
| 1.03 | 3 | |
| 1.04 | 6 | |
| 1.05 | 6 | |
| 1.06 | 3 | |
| 1.07 | 4 | |
| 1.08 | 1 | < 0.1% |
| 1.09 | 5 |
| Value | Count | Frequency (%) |
| 3261.47 | 1 | |
| 3178.51 | 1 | |
| 3154.76 | 1 | |
| 2612.14 | 1 | |
| 2416.72 | 1 | |
| 1782.53 | 1 | |
| 1566.58 | 1 | |
| 1555.17 | 1 | |
| 1526.91 | 1 | |
| 1484.88 | 1 |
city
Text
| Distinct | 176 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 18 |
| Mean length | 8.3976187 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wales |
|---|---|
| 2nd row | Wales |
| 3rd row | Wales |
| 4th row | Wales |
| 5th row | Wales |
| Value | Count | Frequency (%) |
| city | 565 | 3.1% |
| phoenix | 297 | 1.6% |
| san | 296 | 1.6% |
| springs | 230 | 1.3% |
| river | 214 | 1.2% |
| lake | 213 | 1.2% |
| centerview | 197 | 1.1% |
| orient | 192 | 1.1% |
| mountain | 191 | 1.0% |
| red | 188 | 1.0% |
| Other values (195) | 15652 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11742 | 9.7% |
| a | 11095 | 9.1% |
| n | 9331 | 7.7% |
| o | 9242 | 7.6% |
| l | 7802 | 6.4% |
| r | 7319 | 6.0% |
| i | 7244 | 6.0% |
| t | 6494 | 5.4% |
| s | 4612 | 3.8% |
| d | 3885 | 3.2% |
| Other values (40) | 42546 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 121312 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 11742 | 9.7% |
| a | 11095 | 9.1% |
| n | 9331 | 7.7% |
| o | 9242 | 7.6% |
| l | 7802 | 6.4% |
| r | 7319 | 6.0% |
| i | 7244 | 6.0% |
| t | 6494 | 5.4% |
| s | 4612 | 3.8% |
| d | 3885 | 3.2% |
| Other values (40) | 42546 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 121312 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 11742 | 9.7% |
| a | 11095 | 9.1% |
| n | 9331 | 7.7% |
| o | 9242 | 7.6% |
| l | 7802 | 6.4% |
| r | 7319 | 6.0% |
| i | 7244 | 6.0% |
| t | 6494 | 5.4% |
| s | 4612 | 3.8% |
| d | 3885 | 3.2% |
| Other values (40) | 42546 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 121312 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 11742 | 9.7% |
| a | 11095 | 9.1% |
| n | 9331 | 7.7% |
| o | 9242 | 7.6% |
| l | 7802 | 6.4% |
| r | 7319 | 6.0% |
| i | 7244 | 6.0% |
| t | 6494 | 5.4% |
| s | 4612 | 3.8% |
| d | 3885 | 3.2% |
| Other values (40) | 42546 |
state
Categorical
High correlation 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
| CA | |
|---|---|
| MO | |
| NE | |
| OR | |
| WA | |
| Other values (8) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AK |
|---|---|
| 2nd row | AK |
| 3rd row | AK |
| 4th row | AK |
| 5th row | AK |
Common Values
| Value | Count | Frequency (%) |
| CA | 3375 | |
| MO | 2329 | |
| NE | 1460 | |
| OR | 1211 | 8.4% |
| WA | 1150 | 8.0% |
| WY | 1100 | 7.6% |
| NM | 1003 | 6.9% |
| CO | 856 | 5.9% |
| AZ | 673 | 4.7% |
| UT | 597 | 4.1% |
| Other values (3) | 692 | 4.8% |
Length
| Value | Count | Frequency (%) |
| ca | 3375 | |
| mo | 2329 | |
| ne | 1460 | |
| or | 1211 | 8.4% |
| wa | 1150 | 8.0% |
| wy | 1100 | 7.6% |
| nm | 1003 | 6.9% |
| co | 856 | 5.9% |
| az | 673 | 4.7% |
| ut | 597 | 4.1% |
| Other values (3) | 692 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 5371 | |
| O | 4396 | |
| C | 4231 | |
| M | 3332 | |
| N | 2463 | |
| W | 2250 | |
| E | 1460 | 5.1% |
| R | 1211 | 4.2% |
| Y | 1100 | 3.8% |
| Z | 673 | 2.3% |
| Other values (6) | 2405 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 28892 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 5371 | |
| O | 4396 | |
| C | 4231 | |
| M | 3332 | |
| N | 2463 | |
| W | 2250 | |
| E | 1460 | 5.1% |
| R | 1211 | 4.2% |
| Y | 1100 | 3.8% |
| Z | 673 | 2.3% |
| Other values (6) | 2405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 28892 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 5371 | |
| O | 4396 | |
| C | 4231 | |
| M | 3332 | |
| N | 2463 | |
| W | 2250 | |
| E | 1460 | 5.1% |
| R | 1211 | 4.2% |
| Y | 1100 | 3.8% |
| Z | 673 | 2.3% |
| Other values (6) | 2405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 28892 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 5371 | |
| O | 4396 | |
| C | 4231 | |
| M | 3332 | |
| N | 2463 | |
| W | 2250 | |
| E | 1460 | 5.1% |
| R | 1211 | 4.2% |
| Y | 1100 | 3.8% |
| Z | 673 | 2.3% |
| Other values (6) | 2405 |
lat
Real number (ℝ)
High correlation 
| Distinct | 183 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.787692 |
| Minimum | 20.0271 |
|---|---|
| Maximum | 66.6933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 113.0 KiB |
Quantile statistics
| Minimum | 20.0271 |
|---|---|
| 5-th percentile | 33.3305 |
| Q1 | 36.7154 |
| median | 39.6662 |
| Q3 | 41.9404 |
| 95-th percentile | 47.4974 |
| Maximum | 66.6933 |
| Range | 46.6662 |
| Interquartile range (IQR) | 5.225 |
Descriptive statistics
| Standard deviation | 5.3170389 |
|---|---|
| Coefficient of variation (CV) | 0.13363527 |
| Kurtosis | 5.8560156 |
| Mean | 39.787692 |
| Median Absolute Deviation (MAD) | 2.9011 |
| Skewness | 0.69130654 |
| Sum | 574773 |
| Variance | 28.270903 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.7897 | 197 | 1.4% |
| 48.8878 | 192 | 1.3% |
| 33.5623 | 190 | 1.3% |
| 43.0048 | 187 | 1.3% |
| 41.1558 | 187 | 1.3% |
| 33.2887 | 183 | 1.3% |
| 38.9999 | 178 | 1.2% |
| 34.3795 | 169 | 1.2% |
| 43.6498 | 165 | 1.1% |
| 48.34 | 163 | 1.1% |
| Other values (173) | 12635 |
| Value | Count | Frequency (%) |
| 20.0271 | 109 | |
| 20.0827 | 63 | 0.4% |
| 32.274 | 35 | 0.2% |
| 32.7185 | 41 | 0.3% |
| 32.9396 | 126 | |
| 33.0067 | 107 | |
| 33.2887 | 183 | |
| 33.3305 | 105 | |
| 33.4317 | 9 | 0.1% |
| 33.5494 | 71 | 0.5% |
| Value | Count | Frequency (%) |
| 66.6933 | 12 | 0.1% |
| 65.6899 | 36 | 0.2% |
| 64.7556 | 111 | |
| 55.4732 | 14 | 0.1% |
| 48.8878 | 192 | |
| 48.4786 | 119 | |
| 48.34 | 163 | |
| 48.0379 | 15 | 0.1% |
| 47.9657 | 40 | 0.3% |
| 47.6633 | 10 | 0.1% |
long
Real number (ℝ)
High correlation 
| Distinct | 183 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -110.87422 |
| Minimum | -165.6723 |
|---|---|
| Maximum | -89.6287 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 14446 |
| Negative (%) | 100.0% |
| Memory size | 113.0 KiB |
Quantile statistics
| Minimum | -165.6723 |
|---|---|
| 5-th percentile | -123.9743 |
| Q1 | -120.4158 |
| median | -111.0985 |
| Q3 | -101.136 |
| 95-th percentile | -91.8912 |
| Maximum | -89.6287 |
| Range | 76.0436 |
| Interquartile range (IQR) | 19.2798 |
Descriptive statistics
| Standard deviation | 12.985813 |
|---|---|
| Coefficient of variation (CV) | -0.11712202 |
| Kurtosis | 2.4767939 |
| Mean | -110.87422 |
| Median Absolute Deviation (MAD) | 9.5664 |
| Skewness | -0.83923821 |
| Sum | -1601689.1 |
| Variance | 168.63133 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -93.8702 | 197 | 1.4% |
| -118.2105 | 192 | 1.3% |
| -112.0559 | 190 | 1.3% |
| -108.8964 | 187 | 1.3% |
| -101.136 | 187 | 1.3% |
| -111.0985 | 183 | 1.3% |
| -109.615 | 178 | 1.2% |
| -118.523 | 169 | 1.2% |
| -116.4306 | 165 | 1.1% |
| -122.3456 | 163 | 1.1% |
| Other values (173) | 12635 |
| Value | Count | Frequency (%) |
| -165.6723 | 111 | |
| -156.292 | 36 | 0.2% |
| -155.488 | 63 | |
| -155.3697 | 109 | |
| -153.994 | 12 | 0.1% |
| -133.1171 | 14 | 0.1% |
| -124.4409 | 64 | |
| -124.2174 | 93 | |
| -124.1587 | 59 | |
| -124.1437 | 95 |
| Value | Count | Frequency (%) |
| -89.6287 | 90 | |
| -90.2848 | 8 | 0.1% |
| -90.2907 | 36 | 0.2% |
| -90.387 | 126 | |
| -90.4504 | 45 | 0.3% |
| -90.5255 | 73 | |
| -90.9362 | 24 | 0.2% |
| -91.0243 | 156 | |
| -91.0664 | 33 | 0.2% |
| -91.4867 | 49 | 0.3% |
city_pop
Real number (ℝ)
| Distinct | 174 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 106537 |
| Minimum | 46 |
|---|---|
| Maximum | 2383912 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 113.0 KiB |
Quantile statistics
| Minimum | 46 |
|---|---|
| 5-th percentile | 85 |
| Q1 | 493 |
| median | 1645 |
| Q3 | 35439 |
| 95-th percentile | 841711 |
| Maximum | 2383912 |
| Range | 2383866 |
| Interquartile range (IQR) | 34946 |
Descriptive statistics
| Standard deviation | 290291.61 |
|---|---|
| Coefficient of variation (CV) | 2.7247961 |
| Kurtosis | 14.772032 |
| Mean | 106537 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 3.6583766 |
| Sum | 1.5390335 × 109 |
| Variance | 8.4269218 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1312922 | 297 | 2.1% |
| 241 | 282 | 2.0% |
| 2368 | 197 | 1.4% |
| 149 | 192 | 1.3% |
| 1789 | 187 | 1.3% |
| 1645 | 187 | 1.3% |
| 2872 | 183 | 1.3% |
| 46 | 178 | 1.2% |
| 34882 | 169 | 1.2% |
| 84106 | 165 | 1.1% |
| Other values (164) | 12409 |
| Value | Count | Frequency (%) |
| 46 | 178 | |
| 49 | 72 | |
| 60 | 72 | |
| 61 | 124 | |
| 73 | 126 | |
| 85 | 163 | |
| 100 | 82 | |
| 104 | 34 | 0.2% |
| 110 | 59 | 0.4% |
| 121 | 19 | 0.1% |
| Value | Count | Frequency (%) |
| 2383912 | 29 | 0.2% |
| 1312922 | 297 | |
| 1241364 | 148 | |
| 973849 | 148 | |
| 927396 | 45 | 0.3% |
| 841711 | 73 | 0.5% |
| 837792 | 19 | 0.1% |
| 757530 | 109 | 0.8% |
| 641349 | 81 | 0.6% |
| 545147 | 120 |
job
Text
| Distinct | 163 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 33 |
| Mean length | 21.130901 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | "Administrator, education" |
|---|---|
| 2nd row | "Administrator, education" |
| 3rd row | "Administrator, education" |
| 4th row | "Administrator, education" |
| 5th row | "Administrator, education" |
| Value | Count | Frequency (%) |
| engineer | 1699 | 5.2% |
| officer | 1196 | 3.6% |
| surveyor | 964 | 2.9% |
| manager | 804 | 2.5% |
| scientist | 726 | 2.2% |
| education | 694 | 2.1% |
| research | 542 | 1.7% |
| therapist | 527 | 1.6% |
| analyst | 513 | 1.6% |
| land/geomatics | 465 | 1.4% |
| Other values (209) | 24638 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 32382 | 10.6% |
| i | 25292 | 8.3% |
| r | 22629 | 7.4% |
| t | 21181 | 6.9% |
| n | 21083 | 6.9% |
| a | 20227 | 6.6% |
| 18322 | 6.0% | |
| o | 17270 | 5.7% |
| c | 16197 | 5.3% |
| s | 15966 | 5.2% |
| Other values (41) | 94708 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 305257 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 32382 | 10.6% |
| i | 25292 | 8.3% |
| r | 22629 | 7.4% |
| t | 21181 | 6.9% |
| n | 21083 | 6.9% |
| a | 20227 | 6.6% |
| 18322 | 6.0% | |
| o | 17270 | 5.7% |
| c | 16197 | 5.3% |
| s | 15966 | 5.2% |
| Other values (41) | 94708 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 305257 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 32382 | 10.6% |
| i | 25292 | 8.3% |
| r | 22629 | 7.4% |
| t | 21181 | 6.9% |
| n | 21083 | 6.9% |
| a | 20227 | 6.6% |
| 18322 | 6.0% | |
| o | 17270 | 5.7% |
| c | 16197 | 5.3% |
| s | 15966 | 5.2% |
| Other values (41) | 94708 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 305257 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 32382 | 10.6% |
| i | 25292 | 8.3% |
| r | 22629 | 7.4% |
| t | 21181 | 6.9% |
| n | 21083 | 6.9% |
| a | 20227 | 6.6% |
| 18322 | 6.0% | |
| o | 17270 | 5.7% |
| c | 16197 | 5.3% |
| s | 15966 | 5.2% |
| Other values (41) | 94708 |
dob
Date
| Distinct | 187 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
| Minimum | 1927-09-09 00:00:00 |
|---|---|
| Maximum | 2001-07-26 00:00:00 |
trans_num
Text
| Distinct | 14383 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Unique
| Unique | 14320 ? |
|---|---|
| Unique (%) | 99.1% |
Sample
| 1st row | a3806e984cec6ac0096d8184c64ad3a1 |
|---|---|
| 2nd row | a59185fe1b9ccf21323f581d7477573f |
| 3rd row | 86ba3a888b42cd3925881fa34177b4e0 |
| 4th row | 3a068fe1d856f0ecedbed33e4b5f4496 |
| 5th row | 891cdd1191028759dc20dc224347a0ff |
| Value | Count | Frequency (%) |
| f1edc60904bafa8aac00a0f5e9026d0c | 2 | < 0.1% |
| 9bc5cb494abc3af2b02ca33e0d076f74 | 2 | < 0.1% |
| 7d7d61dc3b301c78ca3c0cf73e8ed72e | 2 | < 0.1% |
| 049087fe5d27b77c7238fa46bb18c99d | 2 | < 0.1% |
| fdc202f9f1dd556a51775c6d8060c58d | 2 | < 0.1% |
| dec7f564c518a3f5878016461d766ffa | 2 | < 0.1% |
| 4d7e567247b6c4529ce4c32c03b2f040 | 2 | < 0.1% |
| b87c92d4824758e704da572891697fed | 2 | < 0.1% |
| 98649f992d93377c62c09e51a1bd2cc9 | 2 | < 0.1% |
| 5fbe827807ec9f557f6242bb48db0e51 | 2 | < 0.1% |
| Other values (14373) | 14426 |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 29238 | 6.3% |
| 5 | 29118 | 6.3% |
| 8 | 29057 | 6.3% |
| d | 29019 | 6.3% |
| 6 | 28981 | 6.3% |
| e | 28914 | 6.3% |
| 9 | 28900 | 6.3% |
| 4 | 28900 | 6.3% |
| c | 28887 | 6.2% |
| a | 28868 | 6.2% |
| Other values (6) | 172390 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 462272 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| b | 29238 | 6.3% |
| 5 | 29118 | 6.3% |
| 8 | 29057 | 6.3% |
| d | 29019 | 6.3% |
| 6 | 28981 | 6.3% |
| e | 28914 | 6.3% |
| 9 | 28900 | 6.3% |
| 4 | 28900 | 6.3% |
| c | 28887 | 6.2% |
| a | 28868 | 6.2% |
| Other values (6) | 172390 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 462272 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| b | 29238 | 6.3% |
| 5 | 29118 | 6.3% |
| 8 | 29057 | 6.3% |
| d | 29019 | 6.3% |
| 6 | 28981 | 6.3% |
| e | 28914 | 6.3% |
| 9 | 28900 | 6.3% |
| 4 | 28900 | 6.3% |
| c | 28887 | 6.2% |
| a | 28868 | 6.2% |
| Other values (6) | 172390 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 462272 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| b | 29238 | 6.3% |
| 5 | 29118 | 6.3% |
| 8 | 29057 | 6.3% |
| d | 29019 | 6.3% |
| 6 | 28981 | 6.3% |
| e | 28914 | 6.3% |
| 9 | 28900 | 6.3% |
| 4 | 28900 | 6.3% |
| c | 28887 | 6.2% |
| a | 28868 | 6.2% |
| Other values (6) | 172390 |
merch_lat
Real number (ℝ)
High correlation 
| Distinct | 14376 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.787991 |
| Minimum | 19.032689 |
|---|---|
| Maximum | 67.510267 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 113.0 KiB |
Quantile statistics
| Minimum | 19.032689 |
|---|---|
| 5-th percentile | 33.086286 |
| Q1 | 36.794655 |
| median | 39.620953 |
| Q3 | 42.27574 |
| 95-th percentile | 47.832952 |
| Maximum | 67.510267 |
| Range | 48.477578 |
| Interquartile range (IQR) | 5.4810852 |
Descriptive statistics
| Standard deviation | 5.3605934 |
|---|---|
| Coefficient of variation (CV) | 0.13472893 |
| Kurtosis | 5.748108 |
| Mean | 39.787991 |
| Median Absolute Deviation (MAD) | 2.7316305 |
| Skewness | 0.68076238 |
| Sum | 574777.32 |
| Variance | 28.735962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.998205 | 2 | < 0.1% |
| 39.164469 | 2 | < 0.1% |
| 38.211376 | 2 | < 0.1% |
| 38.748484 | 2 | < 0.1% |
| 38.376171 | 2 | < 0.1% |
| 38.893999 | 2 | < 0.1% |
| 38.435536 | 2 | < 0.1% |
| 39.42207 | 2 | < 0.1% |
| 39.157345 | 2 | < 0.1% |
| 40.701366 | 2 | < 0.1% |
| Other values (14366) | 14426 |
| Value | Count | Frequency (%) |
| 19.032689 | 1 | |
| 19.040141 | 1 | |
| 19.04251 | 1 | |
| 19.070393 | 1 | |
| 19.101256 | 1 | |
| 19.140535 | 1 | |
| 19.161782 | 1 | |
| 19.165823 | 1 | |
| 19.167279 | 1 | |
| 19.169435 | 1 |
| Value | Count | Frequency (%) |
| 67.510267 | 1 | |
| 67.441518 | 1 | |
| 67.397018 | 1 | |
| 67.188111 | 1 | |
| 67.064277 | 1 | |
| 66.835174 | 1 | |
| 66.659242 | 1 | |
| 66.650388 | 1 | |
| 66.646051 | 1 | |
| 66.645176 | 1 |
merch_long
Real number (ℝ)
High correlation 
| Distinct | 14380 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -110.87489 |
| Minimum | -166.67068 |
|---|---|
| Maximum | -88.646366 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 14446 |
| Negative (%) | 100.0% |
| Memory size | 113.0 KiB |
Quantile statistics
| Minimum | -166.67068 |
|---|---|
| 5-th percentile | -123.78017 |
| Q1 | -120.14625 |
| median | -111.19263 |
| Q3 | -100.44682 |
| 95-th percentile | -91.975339 |
| Maximum | -88.646366 |
| Range | 78.024319 |
| Interquartile range (IQR) | 19.699431 |
Descriptive statistics
| Standard deviation | 12.995596 |
|---|---|
| Coefficient of variation (CV) | -0.11720955 |
| Kurtosis | 2.4706392 |
| Mean | -110.87489 |
| Median Absolute Deviation (MAD) | 9.388988 |
| Skewness | -0.83695013 |
| Sum | -1601698.7 |
| Variance | 168.88552 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -109.986757 | 2 | < 0.1% |
| -110.32181 | 2 | < 0.1% |
| -110.36249 | 2 | < 0.1% |
| -109.677707 | 2 | < 0.1% |
| -109.844716 | 2 | < 0.1% |
| -109.555496 | 2 | < 0.1% |
| -109.126592 | 2 | < 0.1% |
| -109.044284 | 2 | < 0.1% |
| -99.38238 | 2 | < 0.1% |
| -121.383937 | 2 | < 0.1% |
| Other values (14370) | 14426 |
| Value | Count | Frequency (%) |
| -166.670685 | 1 | |
| -166.629875 | 1 | |
| -166.625519 | 1 | |
| -166.596324 | 1 | |
| -166.573982 | 1 | |
| -166.550779 | 2 | |
| -166.539712 | 1 | |
| -166.522522 | 1 | |
| -166.414244 | 1 | |
| -166.410533 | 1 |
| Value | Count | Frequency (%) |
| -88.646366 | 1 | |
| -88.651755 | 1 | |
| -88.720514 | 1 | |
| -88.742942 | 1 | |
| -88.795852 | 1 | |
| -88.825507 | 1 | |
| -88.825894 | 1 | |
| -88.872009 | 1 | |
| -88.901988 | 1 | |
| -88.927438 | 1 |
is_fraud
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 113.0 KiB |
| 0 | |
|---|---|
| 1 | |
| 1"2020-12-24 16:56:24" | 1 |
| 0"2019-01-01 00:00:44" | 1 |
Length
| Max length | 22 |
|---|---|
| Median length | 1 |
| Mean length | 1.0029074 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 12600 | |
| 1 | 1844 | 12.8% |
| 1"2020-12-24 16:56:24" | 1 | < 0.1% |
| 0"2019-01-01 00:00:44" | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 12600 | |
| 1 | 1844 | 12.8% |
| 1"2020-12-24 | 1 | < 0.1% |
| 16:56:24 | 1 | < 0.1% |
| 0"2019-01-01 | 1 | < 0.1% |
| 00:00:44 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12610 | |
| 1 | 1850 | 12.8% |
| 2 | 6 | < 0.1% |
| " | 4 | < 0.1% |
| - | 4 | < 0.1% |
| 4 | 4 | < 0.1% |
| : | 4 | < 0.1% |
| 2 | < 0.1% | |
| 6 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14488 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 12610 | |
| 1 | 1850 | 12.8% |
| 2 | 6 | < 0.1% |
| " | 4 | < 0.1% |
| - | 4 | < 0.1% |
| 4 | 4 | < 0.1% |
| : | 4 | < 0.1% |
| 2 | < 0.1% | |
| 6 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14488 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 12610 | |
| 1 | 1850 | 12.8% |
| 2 | 6 | < 0.1% |
| " | 4 | < 0.1% |
| - | 4 | < 0.1% |
| 4 | 4 | < 0.1% |
| : | 4 | < 0.1% |
| 2 | < 0.1% | |
| 6 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14488 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 12610 | |
| 1 | 1850 | 12.8% |
| 2 | 6 | < 0.1% |
| " | 4 | < 0.1% |
| - | 4 | < 0.1% |
| 4 | 4 | < 0.1% |
| : | 4 | < 0.1% |
| 2 | < 0.1% | |
| 6 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Interactions
Correlations
| amt | category | city_pop | is_fraud | lat | long | merch_lat | merch_long | state | |
|---|---|---|---|---|---|---|---|---|---|
| amt | 1.000 | 0.196 | 0.024 | 0.381 | 0.021 | -0.005 | 0.020 | -0.007 | 0.029 |
| category | 0.196 | 1.000 | 0.009 | 0.165 | 0.024 | 0.021 | 0.019 | 0.021 | 0.020 |
| city_pop | 0.024 | 0.009 | 1.000 | 0.028 | -0.338 | -0.078 | -0.340 | -0.082 | 0.297 |
| is_fraud | 0.381 | 0.165 | 0.028 | 1.000 | 0.062 | 0.058 | 0.058 | 0.055 | 0.060 |
| lat | 0.021 | 0.024 | -0.338 | 0.062 | 1.000 | -0.179 | 0.987 | -0.179 | 0.745 |
| long | -0.005 | 0.021 | -0.078 | 0.058 | -0.179 | 1.000 | -0.176 | 0.995 | 0.750 |
| merch_lat | 0.020 | 0.019 | -0.340 | 0.058 | 0.987 | -0.176 | 1.000 | -0.176 | 0.713 |
| merch_long | -0.007 | 0.021 | -0.082 | 0.055 | -0.179 | 0.995 | -0.176 | 1.000 | 0.771 |
| state | 0.029 | 0.020 | 0.297 | 0.060 | 0.745 | 0.750 | 0.713 | 0.771 | 1.000 |
Missing values
Sample
| trans_date_trans_time | merchant | category | amt | city | state | lat | long | city_pop | job | dob | trans_num | merch_lat | merch_long | is_fraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 04-01-2019 00:58 | "Stokes, Christiansen and Sipes" | grocery_net | 14.37 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | a3806e984cec6ac0096d8184c64ad3a1 | 65.654142 | -164.722603 | 1 |
| 1 | 04-01-2019 15:06 | Predovic Inc | shopping_net | 966.11 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | a59185fe1b9ccf21323f581d7477573f | 65.468863 | -165.473127 | 1 |
| 2 | 04-01-2019 22:37 | Wisozk and Sons | misc_pos | 49.61 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 86ba3a888b42cd3925881fa34177b4e0 | 65.347667 | -165.914542 | 1 |
| 3 | 04-01-2019 23:06 | Murray-Smitham | grocery_pos | 295.26 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 3a068fe1d856f0ecedbed33e4b5f4496 | 64.445035 | -166.080207 | 1 |
| 4 | 04-01-2019 23:59 | Friesen Lt | health_fitness | 18.17 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 891cdd1191028759dc20dc224347a0ff | 65.447094 | -165.446843 | 1 |
| 5 | 05-01-2019 03:15 | "Raynor, Reinger and Hagenes" | gas_transport | 20.45 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | ef010a5f4f570d306a050a368ee2729d | 64.088838 | -165.104078 | 1 |
| 6 | 05-01-2019 03:21 | Heller-Langosh | gas_transport | 18.19 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 8e2d2fae5319d31c887dddbc70627ac4 | 63.917785 | -165.827621 | 1 |
| 7 | 05-01-2019 11:31 | Padberg-Welch | grocery_pos | 367.29 | Browning | MO | 40.0290 | -93.1607 | 602 | Cytogeneticist | 14-07-1954 | 5fbe827807ec9f557f6242bb48db0e51 | 39.167065 | -93.705245 | 1 |
| 8 | 05-01-2019 18:03 | McGlynn-Heathcote | misc_net | 768.15 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | fba83e0a3adb530251295ab72a96b719 | 64.623325 | -166.403973 | 1 |
| 9 | 05-01-2019 22:02 | Dooley-Thompson | misc_net | 849.49 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | b87c92d4824758e704da572891697fed | 65.266065 | -164.865352 | 1 |
| trans_date_trans_time | merchant | category | amt | city | state | lat | long | city_pop | job | dob | trans_num | merch_lat | merch_long | is_fraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14436 | 22-01-2019 00:18 | "Connelly, Reichert and Fritsch" | gas_transport | 93.23 | Unionville | MO | 40.4815 | -92.9951 | 3805 | "Investment banker, corporate" | 15-09-1950 | 58d980b2db4f0581aaa3e62967072efa | 40.527285 | -93.859674 | 0 |
| 14437 | 22-01-2019 00:19 | "Kuhic, Bins and Pfeffe" | shopping_net | 4.65 | Eugene | OR | 44.0385 | -123.0614 | 191096 | "Scientist, physiological" | 06-04-1964 | 6f552aa7397e6e1c012c25ecfc0cc9b7 | 43.821635 | -122.497236 | 0 |
| 14438 | 22-01-2019 00:23 | Sporer Inc | gas_transport | 51.57 | Carlotta | CA | 40.5070 | -123.9743 | 1139 | "Therapist, occupational" | 15-01-1951 | c10ca0af6656b71e6da577da9db6c8c3 | 40.556556 | -124.887658 | 0 |
| 14439 | 22-01-2019 00:32 | "Willms, Kris and Bergnaum" | shopping_pos | 145.60 | Athena | OR | 45.8289 | -118.4971 | 1302 | Dealer | 18-10-1976 | c206b545e52a142e1009fb0bd3e3f2ac | 46.592719 | -118.002289 | 0 |
| 14440 | 22-01-2019 00:37 | Wiza LLC | misc_pos | 37.92 | Syracuse | MO | 38.6547 | -92.8929 | 628 | "Radiographer, diagnostic" | 18-12-1961 | a98a9e2ca6a7c605c34a4298be3ad606 | 39.245730 | -92.441388 | 0 |
| 14441 | 22-01-2019 00:37 | Hudson-Grady | shopping_pos | 122.00 | Athena | OR | 45.8289 | -118.4971 | 1302 | Dealer | 18-10-1976 | 699a4c06b22711bf3e0d8ef91232d356 | 46.442439 | -118.524214 | 0 |
| 14442 | 22-01-2019 00:41 | "Nienow, Ankunding and Collie" | misc_pos | 9.07 | Gardiner | OR | 43.7857 | -124.1437 | 260 | "Engineer, maintenance" | 01-09-1956 | 080d620d24815c7d6c637cf0b71dde8e | 42.901265 | -124.995317 | 0 |
| 14443 | 22-01-2019 00:42 | Pacocha-O'Reilly | grocery_pos | 104.84 | Alva | WY | 44.6873 | -104.4414 | 110 | "Administrator, local government" | 16-05-1973 | 3c346c8cd627c5fe3ed57430db2e9ae7 | 45.538062 | -104.542117 | 0 |
| 14444 | 22-01-2019 00:48 | "Bins, Balistreri and Beatty" | shopping_pos | 268.16 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | e66ffcc95ba7fc490486242af1205d04 | 64.081462 | -165.898698 | 0 |
| 14445 | 22-01-2019 00:55 | Daugherty-Thompson | food_dining | 50.09 | Unionville | MO | 40.4815 | -92.9951 | 3805 | "Investment banker, corporate" | 15-09-1950 | 65e7370f473f9b9d75796c8033a7c929 | 40.387243 | -92.224871 | 0 |
Duplicate rows
Most frequently occurring
| trans_date_trans_time | merchant | category | amt | city | state | lat | long | city_pop | job | dob | trans_num | merch_lat | merch_long | is_fraud | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 04-01-2019 00:58 | "Stokes, Christiansen and Sipes" | grocery_net | 14.37 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | a3806e984cec6ac0096d8184c64ad3a1 | 65.654142 | -164.722603 | 1 | 2 |
| 1 | 04-01-2019 15:06 | Predovic Inc | shopping_net | 966.11 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | a59185fe1b9ccf21323f581d7477573f | 65.468863 | -165.473127 | 1 | 2 |
| 2 | 04-01-2019 22:37 | Wisozk and Sons | misc_pos | 49.61 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 86ba3a888b42cd3925881fa34177b4e0 | 65.347667 | -165.914542 | 1 | 2 |
| 3 | 04-01-2019 23:06 | Murray-Smitham | grocery_pos | 295.26 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 3a068fe1d856f0ecedbed33e4b5f4496 | 64.445035 | -166.080207 | 1 | 2 |
| 4 | 04-01-2019 23:59 | Friesen Lt | health_fitness | 18.17 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 891cdd1191028759dc20dc224347a0ff | 65.447094 | -165.446843 | 1 | 2 |
| 5 | 05-01-2019 03:15 | "Raynor, Reinger and Hagenes" | gas_transport | 20.45 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | ef010a5f4f570d306a050a368ee2729d | 64.088838 | -165.104078 | 1 | 2 |
| 6 | 05-01-2019 03:21 | Heller-Langosh | gas_transport | 18.19 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | 8e2d2fae5319d31c887dddbc70627ac4 | 63.917785 | -165.827621 | 1 | 2 |
| 7 | 05-01-2019 11:31 | Padberg-Welch | grocery_pos | 367.29 | Browning | MO | 40.0290 | -93.1607 | 602 | Cytogeneticist | 14-07-1954 | 5fbe827807ec9f557f6242bb48db0e51 | 39.167065 | -93.705245 | 1 | 2 |
| 8 | 05-01-2019 18:03 | McGlynn-Heathcote | misc_net | 768.15 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | fba83e0a3adb530251295ab72a96b719 | 64.623325 | -166.403973 | 1 | 2 |
| 9 | 05-01-2019 22:02 | Dooley-Thompson | misc_net | 849.49 | Wales | AK | 64.7556 | -165.6723 | 145 | "Administrator, education" | 09-11-1939 | b87c92d4824758e704da572891697fed | 65.266065 | -164.865352 | 1 | 2 |